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ABSTRACT 


The optimal time-sequential distribution of supporting 
fire against enemy ground units in combat against attacking 
friendly units is studied, Lanchester-type models of war- 
fare are combined with optimal control theory in this in- 
vestigation. The optimal time-sequential fire-support 
policy is characterized for a specific problem. Although 
complete details for the determination of the optimal policy 
are not Piven, 1t71S conjectured, on the basis of the 
theorems which were proved, that for this problem the optimal 
policy is to always concentrate all supporting fire on the 


Same enemy unit (until supporting fire must be lifted). 
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I. INTRODUCTION 


The problem of fire support allocation is a problem of 
interest to the military tactician and planner. In this 
paper, we will study the problem of artillery fire support 
allocation against several enemy ground units. The problem 
LS eto determinevan optimal time-sequential policy for dis- 
tributing available fire support without wasting it. By 
Wasting of firé support we mean overkilling, i.e., the 
destruction of enemy forces that does not contribute to the 
attainment of the friendly objective. Overkill would cor- 
respond to a state variable which Represents themstrenath 
of a ground unit becoming negative. We do not want to over- 
kill because it will be simply wasting the effort. 

To decom ae 2 “food allocation of fire#®support, ene 
must consider the dynamic nature of combat in which opposing 
forces trade casualties over a period of time. Lanchester 
type models of warfare (see [15]) have been developed to 
provide insights into the dynamics of combat. Although com- 
bat attrition certainly is a random process, such a formula- 
tion is not mathematically tractable as far as analytic work 
feomcolccincad, = inercfore, we will consider deterministic 
Paucnasret= type models in our work here, These are differen- 
tial cqwuattonmmedels of the combat tattrition process. With 
such a dynamical system model and considering that fire sup- 


port decisions are made over a period of time, one is led 





to an optimal-control/differential-game formulation for the 
determination of optimal time-sequential fire support 
allocation. 

Such time-sequential combat optimization problems may 
be cast as either optimal control problems or differential 
game, depending on whether or not both sides are modelled 
as rational decision makers. In the exploratory work at 
hand, we will consider the optimization of combat decision 
by only one of the combatants. This leads to consideration 
of an optimal control problem. Such a problem has been 
referred to as a Lanchester-type optimal control problem by 
Taylor (see [14]). 

In a deterministic combat optimization problem, we usually 
pucoUmter ciuther aestatic or dynamic situation, If the sat- 
uation. If the situation is static, we may determine the 
optimal policy using the classical techniques from advanced 
calculus for problems that are either unconstrained or sub- 
ject to equality constraints. If there are inequality eer 
Straints in the optimization problem, the methods of nonlinear 
programming may be used. If the combat situation is dynamic, 
we may apply the methods of the classic calculus of variations 
for problems that are either unconstrained or subject to 
Cquality constraints (see [4]). However, if inequality con- 
Sitraints) are invoilved, it»may be necessary to apply the 
RecCiUmrecSMILS Of optimals control theory. 

The development of solutions to Lanchester-type optimal 
Sitoienroplems 15 not a routine matter due to several tech- 
Diedieciitrteultrese Dile to physical considerations force 
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levels cannot become negative, and we must introduce state- 
variable inequality constraints (SVIC's), Ce. no state 
variable (force level) can be allowed to be negative. All 
Lanchester-type dynamic tactical allocation problems have 
SVIC's. The maximum principle (in its original form) is 
inadequate to solve a problem with SVIC's, since an ex- 
tremal may contain a subarc which lies on the boundary of 
the state space. By an extremal we mean any trajectory on 
which the necessary conditions of optimality are satisfied 
for alj t. When SVIC's are present in a problem, the 
necessary conditions (i.e., Pontryagin's maximum principle) 
need to be modified (see [7]). To simplify considerations 
for the problem studied in this thesis, we will assume that 
conditicns are such that ali state variables are aiways 
non-negative. Thus SVIC's are not considered further im 
this thesis. 

ANOtNeT GitLiCuliyeeormtnmis tCYpeuolsprobiem is@the pos- 
Sible presience of a singular subarc. Consider a control 
problem in which, for example, a single control variable u 
appears linearly in the Hamiltonian H. The maximum prin- 
ciple fails to determine the optimal control when dH/du = 0 
LOmeaomeinntwe interval Oofstime. Ihe Corresponding trajectory 
Pmedeledsa sungular subare. Use of the terminology singu- 
figets Motivated by themiact that in such problems the matrix 
ef Secondgpartial derivamuves of the Hamiltonian with respect 
BOmtcmeontrolevariables¥is a singular matrix. We will call 


cot endaiesimoular 1£ it cont@ins a singular subarc. 





We will study the problem at hand in two phases. In the first phase 
we will consider nonsingular extremals, while in the second phase we 
will consider singular extremals. One result of the study for this 
problem is that there is no nice algorithm to determine the optimal 
control for our problem. In general, combat optimization problems 
Aogline with multiple units are extremely complicated, especially if 


there are some Singular subarcs. 





ieee | Dove RY BERE oUPPORT ALEOCATION MODEL 


Consider combat between two ground forces Red and Blue. 
Each side is composed of two infantry units. Blue infantry 
mmits are) being supported by artillery fire. 

We want to find the best policy for allocating Blue's 
artillery fire support against Red to achieve the maximum 
Btfectiveness. the measure of effectiveness is taken to be 
Ete ratvo of Blue infantry forces to Red at the given time. 
The problem is to determine an optimal policy by which the 
PimtomOotetwOomtitantry L£LOrcessatethe end of the battle can 


: be maximized. 











The Blue forces will be denoted as Xu and X19 (with 
corresponding force levels x4 (t) and X17 (t)) and the Red 
forces as Xo4 and X59 (with corresponding force levels 
X54 (t) and X59 (t))-. 

The following assumptions will be made for developing 
the model. 

1. Each infantry unit is only in contact with his op- 
posing unit. 

2. The Blue infantry units are moving to contact with 
the Red units and due to their movement cause negligible 
attrition to the’ opposing Red forces. Furthermore, the 
Red defensive positions are prepared and relatively invul- 
nerable to small arms fire during this approach phase. 

3. Tne defensive Red units cause attrition to the op- 
posing Blue units according to a "square law" attrition 
process. 

4. The Blue artillery delivers area fire against the 
Red infantry. 

With the above assumptions we may develop the following 


combat optimization model: 


sgl 4 X12 M1) 


mMmaxXlMm1iZe J = “1x5, 0) 3 2X55 CW) 
Subject to: ee) = -a,X51(t) 


X19(t) = 1G 5hq.9 (8) 


k(t) = -Ku(t)xy, (0) 


see) ere elmer t))) x5, (¢) (1a) 
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where x..(t): force level at time t 
x..(t): the time derivatives of-state variables mee) 
T: time at which the battle terminates 
u(t): control variable 
eat ebletoOMmeratomeoctraClents 


k.: utilities assigned to ratio of survivors. 


SimcCcenmtniewcrilterzon Lunectional contains torce ratios at 
the terminal time, we may formulate a problem with a state 


space of lower dimension by defining new state variables as 


oq 


— = X 


o7 = 
dL. X99 


2 


— 


2° 


o 
> 


Then we can show that (la) is equivalent to the following 


optimal control problem: 


maximize J = kx, @) ot k,x, (1) 
subject to X, = -ay + CyUX, 
X5 = -a, + c,(1-u)x, (1b) 


where all constant coefficients remain same as (la). 


Let us assume that we are given the initial conditions 


A Ke a Xe 
x, (0) =X, > x, (0) Xo. 


Let us assume also that we are given a specified time Ty Wise ta 
meecolleastine duration of the combat. We will aissume that 
meme battle will be terminated at time T under one of the 


following conditions: 
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Condit) ON ie: x, (7) = x, (T) = 0, for some T<T,. 


Condition 2: T=T,. 





Pig’, DEVELOPMENT OF NECESSARY CONDITIONS OF OPTIMALITY 


To develop the necessary conditions for optimality, let 


us define the Hamiltonian for the problem (1b), 
H(t,x,p,u) = py(-aytcyuxy) * po(-aytcy(1-u)x,)3 (2) 


Py and Pp, are the adjoint variables, corresponding to the 


State variables Xy and Xo» respectively and, satisfying the 


differential equations: 


Te ae 
dt ax 1Py 
dp 
a eo oe 
Tt ~~ 3x; 7 (igu)p.- (3) 


By Pontryagin's maximum principle, we can show that the 
nonsingular optimal control will be obtained by maximizing 
the Hamiltonian over the control variable. Hence we may 


poserve that the extremal control isjthen given by 
0 iit. So 
u(t) = iL tf S + 0 
PTGeteriilatew Ls: o =) 0), 
Dnere Sa CiPy%X1 - CoPoX>- 


ne Willi vagal) u@the extremal control function and $& the switch- 
ing function. The extremal control u so defined may furnish 
the global maximum, a local maximum, or only a stationary 


malities tor the criterion function. 
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Ssamce the Hamiltonian TH 1s a linear function of the con- 
meolowarianle UU, if the iswitching function 5S becomes iden- 
mictliye zero ior tsome finite interval of timel, tuhe Maximum 
principle fails to provide any information about the optimal 
pontrol u. In this case all the admisible values of u 
maximize the Hamiltonian; therefore theory of singular ex- 
tremals is required to solve this problem. If S = de 1 Se 
be identically zero for some finite interval of time, then 
all its derivatives with respéct to the time must also van- 


ish. That is, the trajectory remains on a singular subarc 


if and only if 


eS eS oe eee (5) 


From these conditions, we will have 


0 = aa S C4P4Xy 7 CoP 5X4 (6) 
eon. oo 
bee de © am me Sy A2°2P 2 i) 
d H + 
0 = aor ( 2) = Bo Aeon > epee in 0) (8) 


From (6) - (8), we have 


aa (9) 


aeeinane See tne tsingullar control. 
ihe eeneralized Legendre-Cllebsch condition is nécessary 


mor simeular subarc to yicld a maximum return (see [9]). 


LS 





q2k 


k oO oH 
ann Magetk (Fg ee Os 


The condition will be given (-1) 


the problem at hand, 


9 d2 oH 7 2 2 
gu | aez © ay) J = 4c Py * 42¢9P2 2 9; 


since p, 2 0. Thevetlore. wc Legendre-Clebsch condition is 
satisfied. 
For the singular subarc S = S = 0; it follows that 
x, (t) R an i 
Ey 
We will refer to the surfaces So defined by (10), and Sy? 
defined by (7) as singular surfaces. Singular arcs lie in 


mnesewe Lhe tsurface defined by 


= Sie we ne ale 
Ps called the switching surface. 

The surfaces or curves =), and Se can be drawn in the p 
and x space, respectively, but the switching surface S in- 


volves both p and x and we cannot visualize it easily. 
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IV. SYNTHESIS OF EXTREMAL CONTROL 


A. SYNTHESIZING FUNCTION 

Let X be the Euclidean space on which the state variables 
x and their associated adjoint variables p are defined, and 
U be the Euclidean space for control variables. Suppose we 
Sane tind® a realevalued function g: X+U such that x ="£(x,2¢(x,p)) 
determines all the extremal trajectories from initial to ter- 
mba! eLmec meinen the Optimal control functionwmay be explicit-— 
ly determined by the function of state variables and adjoint 
variables such that the optimal control at each time depends 
only on the points of the space at which the state variables 
and the adjoint variables are located at the given time. 
phesfuncrionec(x,p) will be called’ a synthesizing, function 
and the explicit determination of such a function is called 
the synthesis of the extremal control, and the explicit de- 
Meoriilactlom OL the time history of the extremal trajectory 
moecal ledssynthesis of extremal trajectory. 

Pitroreumately, sanwexistence theorem for the synthesizing 
Mmcelon has! Met bDeeneproveas, Furthermore, there Ws no gencrall 
LOL men a tommcne spaoblem of tangane Such a function. If the 
Scmeomrmeouiumalevalues for the sttate variables 1s not explicit- 
i, Speciicds it May be extremely dutficult to find a syn- 
thesizing function. The determination of the synthesizing 
Rice Loenm must bd done on a casc-by-cas@ basus, since there 1s 
no general algorithm. In the problem at hand the terminal 


BeaGee iss Not specificd. However, we know that the control] 


1 





depends upon the state variable x and the adjoint variable 
p, and it will be determined by the maximum principle when 
(x,p) does not belong to the singular surface. 

The difficulty is that x is only known initially and p 
is known only at the problem's end until we have managed to 
match x and p with the maximum principle to get a solution. 

However we may 2 ital ae the behavior of the switching 
function, which is a type of synthesizing function. Although 
it involves both state and adjoint variables we can infer 
its behavior as a function of time, along an extremal, to 
establish several results. Further the trajectory lies on 
Singular subarc for some finite interval of time, we may 
consider the singular surfaces Sy and Sy as a synthesizing 
function. 

B. SYNTHESIS OF CONTROL FOR A TIME INTERVAL DURING WHICH 

THE CONTROL IS CONSTANT 

For synthesizing extremal trajectory, let us consider 
the Benst ant control at first. Suppose we choose a constant 
optimal control from the beginning to the end of the battle. 
Then from the differential equation (1b), we will have the 


following solutions. 


For u = 1; 
a ce t a 
2 > ie 1 ae 
x, (t) = max [Oxy = Je eS 0 | 
iL 1 
x,(t) = max [ X5 Sapo | (12a) 
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P,(t) + 


P, (t) > 


S(t) =. 


S(t) = 


S(t) 


For u = 0; 
x,(t) = 


x, (t) = 


Pp, (t) 


P, (t) 

S(t) = 

S(t) = 

SiGe = 

For u-= 8 = 
x, (t) 


x2 (t) = 


Pp, (t) 


P5(t) = 


ky 
ko 
c,(T-t) 
czk x1 - C2k,X, 
c,(T-t) 
“a,c ,k,e + a5Cok, 
ieee 


max { (x, =e es 0} 
2 yi, 
iS] 
c,(T-t) 
ke 
c,(T-t) 
Sigil | Ce ae 
c,(T-t) 
-a,c,k, + a,cok,e 
c,(T-t) 
Ie: d 
a4c5k,e 
2 
+ Co 
a a 
max { (x; - = yore + , O} 
a a 
max { (x5 - — je%* + -£ , 0} 
k ao (T-t) 
k Batt) 


(12b) 


(iZe) 


(12d) 


(1265) 


(13a) 


(13b) 


(13c) 


(13d) 


(13e) 


(14a) 


(14b) 





Gul=t) 


S(t) = (c,k,x; - cok, x,)e (14c) 
S(t) = (-a,c,k, - a,c,k,)e%F-®) (144) 
e Oo see: ; 
S(t) = at e (aycjk, - azc,k,) (14e) 
where 
Cac 
a 
a 


We will use the above results to determine the optimal 


~ 


policy for more general cases. 

C. DEVELOPMENT OF THEOREMS ON THE CHARACTERIZATION OF AN 
Ori MAL EOLnTGY 
By analyzing the behavior of switching function, we may 


establish the following theorems. 


THEOREM 1 
Let E be a maximizing extremal which contains no singular 
subarc for the optimal control problem given by (1b). Then 


there is at most one switching in E. 


By hypothesis, since E does not contain a singular sub- 
arc, the optimal control can be determined by Pontryagin's 
maximum principle, except possibly at one point, as we shall 
see. We have got the switching function from the Hamiltonian 


by means of the maximum principle, 


S(t) = Cy PyX1 > €oPX>- 
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Then its derivative is 


S(t) = ag¢op, - ayCyPy 
and 
S(t) = a,CjP,u - a,c5p,(1-u). 


It follows that 


S(t) > 0 if u 


i 
_ 


S(t) <0 if u 


I 
a) 


Since the trajectory does not lie on singular subarc, we will 
have either u = 1 or u = 0 at t = 0. Let us consider the case 
u= 1 at t = 03; a similar discussion will hold if u = 0 ini- 
ey ote el = vat t =™ 05 5(0) > 0. 

1. Let us consider first the cases in which S(0) > 0. 


There are two sub-cases, accordingly as S (0) Za ei SO) << We 


Case 1: S(0) > 0 and $§ (0) > 0. 
Since S(0) > 0, we must choose u = 1 at the beginning of 


c, (T-t) 
the game. From (12b), Pp, (t) = kie : Pp, (t) = k,. There- 


2 
fore, p, (t) is decreasing function of t and P, (t) is constant. 
Hence S(0) > 0 implies S(t) > 0 and S(t) > 0, for all t > 0. 
Therefore S(t) is a monotone increasing function, and conse- 


quently we will have S(t) > 0 for te[0,T]. Hence no switch- 


ing occurs. 


Case 2: S(0) > 0 and S(0) < 0. 
From equation (12c), S must have the form 
-c,t 


nae aie ae | (15) 
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in which A and B are positive, so long as u=1. But if S 
becomes negative u must become 0 and S will have a different 
form. 

Let t be the value of t for which D = 0. Consider also 
S: S may become zero at some value t, < t or S(t) may be 


greater than or equal to zero. There are three subcases. 


DS 


Subcase 1 ’ , 

It may be that S > 0 for all t < t. In this case S > 0 
for all t, since S > 0 when u = 1. -Hence no switching occurs. 

Subcase ii 

It may be that S becomes zero at some time t, < tT and 


S(t,) oe Unhenetiemes  SRagswitching atet= ty: From the 
0 it fol- 


facts that S(t,) = ORSIGCED) < 0 and S < 0 when u 
lows that ty is a switching point, and that S < 0 for all 
t > t,- 

Subcase ili 

It may be that ty in case 11 coincides with Tt, so that 
= 0 and S(t 


S(t = 0 for some t} < T. Then the maximum 


1) 1 


Ze 


principle does not determine the control at time t1- If we 
choose u= 1 att= ti» when Sea) > 0. It follows that 
S(t), S(t) are positive for t > ty and there is no switching. 
If we choose u = 0 at t = ti> then by the above arguments 


S(t) < 0 for t > t, and there is one switching. 


1 

2. It may be that S(0) = 0. If S(0) > 0 then S(t) is 
always positive. If $(0) < 0 then S(t) < 0 for t > 0 and 
ie Oe Pinal ly Ft S(0) = 0 then we may choose u(0) arbi- 
trarily. If we choose u = 1 initially then S > 0 for t > 0 
by argument given in case 2-iii and there is no switching. 
Similarly, if we choose u(0) = 0, S remains negative. 

This takes care of all extremals which do not have a 


Singular arc; a nonsingular extremal has at most one switch- 


ang pornt. 


COROLLARY: 
If E contains a singular subarc, then there are at most 


two switchings. 


Suppose S(0) > 0 and there is ty > 0 such that S(t,) = 
S(t,) = 0. We will show that there are at most two switch- 
ings for this case. The case for S(0) < 0 will be symmetrical 
to the case of S(0) > 0. Since we will choose the singular 


control at t,, we may assume that the trajectory will remain 





on the singular subarc until some time t,. If t, 2 T, we 
Cc 
will have only one switching at ty from u = 1tous= Z . 
c,tc 


HN &O 


Suppose t. < T. Assume that at some time t, we choose u 


(i) 


when we are on the singular subarc. Then, by the argument 
given in 2-iii, S will become positive and remain positive 
horedal ls te > t,. Similarly if we choose u = 0, S will be- 
come negative and remain negative. Thus we will have at 
most two switchings, one at t,. say, when the extremal enters 


the singular arc and one at t, when it leaves it. 


THEOREM 2 
Consider a path E10 defined by the constant control 


ceo. t < £ and u = 0, t, <t< T, so that E10 has 


hi? 


one switching and no singular subarc. Let J,, be the as- 


10 
sociated criterion function. If Xj > a,/c, and X, > a,T, 
then there is no maximizing extremal of the form E10: By 
Symmetry, there cannot be one of the form Fol in which 


Xs > a,/c, and x4 > a,T. That is, a properly chosen constant 


control is always better than a mixed policy of this type. 


Proof: 
Suppose we choose the extremal control u = 1 for te[0,t,], 


and u = 0 for te[t,.T]. Then from (12a) and (13a), we have 


X,(t,) = max { (xy - —=)e + — , 0} 


X,(t,) max { X5 - aot,), 0}. 
For te[t,,T], 
x, (t) = max {x9 (t) - a, (t-t,),0} 
ig onal 


Or ae nanan es a 
max {x4 cy Je + ; a,(t t,),0} 


z4 


a | : 





7 Se 


| a Ce it=t.) a 
x(t) =Smax{ (x3 (t.) et ee = oe 0 
ib Zor A Co c 
2 
a cs (=e) ) a 
Es _ _ mee 2 1 ws 
max{ (x, aot, co )oe + cy O}. 


Suppose x, (t), x, (t) > 0 for all te[0,T]; then the payoff 


will be 
a ops a 
a oe LoL eee. . 
S40 k(x, cy je + oA a, (T x4) ] + (16) 
a c,(T-t) a 
_ _2 4 _2 
gC a erie = 
a adh 6 a c,(T-t,) 
t = 0 _ ee 1-1 Ut = 2 1 Cz 
ome 11 oc, e pe, ee 8 (X2-agty) 
a Cot e,(T=ta) a 
1 2 o | a 1-1 2 Z 1 Py _& 
dag > Fae ales a Me 262 ee) 
#3) x5 if 
Since x4 > — and T < — , we always have J,, > 0. Therefore, 
- Cy - a, 10 
we will have a maximum payoff at the boundary value of ti> 
H.C. ty = 0 or ty = T. 
If t, = 0, 
ao Col ao 

max Jig = Jg = k, (x -a,T) + k, [x3 - om )oe + c ] 

et t, = 1, 


. ay cyl 
max J 10 = J 1bX4 - an JS 


il 
ray 


+ oA ] + k, (x5 = aT). 


In other words, we cannot improve the payoff by switching the 
control. We will have the same result for Jor: for which we 


Switch the control from u = 0 to u= 1. We may summarize 


he 





this in the following corollary: 


COROLLARY: 
All nonsingular controls which maximize the criterion 


function for this problem have the form u = 1 or else u = 0. 
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V. DETERMINATION OF THE OPTIMAL CONTROL 


A. SUFFICIENT CONDITIONS FOR OPTIMALITY 

As is well known the maximum principle only provides 
necessary conditions of optimality. Second order sufficient 
conditions (see [1]) are impractical to apply to the problem 
at hand. Mangasarian's sufficient conditions (see [10]), 
unfortunately do not apply to the problem at hand. 

The following is an adoptation of Mangasarian's Theorem 
to a form more closely related to the specific problem under 


study. Consider the following optimal control problem. 


Ib 
Maximize ( PCt,x,u) dt + G(x sty) 
U. ui a 
O 


SRIDsICCimetO = - f(t ex Ul). 
g(t.x§u) > 0, 

the initial conditions C(x) Os 
the terminal conditions C, (X¥) = 0 
where x, = a 0) a Xe = x(T) for specified T. 
THEOREM: 

Assume that F(t,x,u) and each component of f(t,x,u) and 
g(t,x,u) are differentiable and concave in the variable (x,u) 
for te[0,T], and also each component of C,(%) and C, (Xx) is 


differentiable and concave in Xo and Xp respectively. Then 


a control u(t) yields a maximum for problem (17) if there 


Ze 





exist U(t), x(t), p(t), q(t), T(t), S(t) such that; 


H(t oe U,p.G.6 Sime a(t, x,u) + p £(t,x,u) a qi g(t,x,u) 


+ rC,(x,) + SC, (X¢) 


oe —T ee = me to = 
ee ee ee Cm Pi) 
H = F (t,x,u) + p £ (t,x,U) + qé-g (t,x,u) = 0 
u . u Melee J p u 3 “~3 G 8y ba | 
Bx Xo %e) " rC, (x,) | CO) 5-80 
Bx, Si? Xe + sC,(X¢) - p(T) = 0 

iy aes, 

rC, (x,) = 0 

s > 0 

SC, (Xp) 0 

q > 0 


However, for the prep ie at hand (1b) it is readily seen 
that the right-hand sides of the differential equation are 
not concave functions of the state and control variables so 
that the above theorem cannot be applied. We will return to 


discuss our problem. 
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B. DETERMINATION OF OPTIMAL CONTROL IN TERMS OF INITIAL 
CONDITIONS AND TERMINAL CONDITIONS 


In general we cannot determine the optimal control with- 
out some computation and comparison. However, in some cases 
we can establish its form easily. In this section two theo- 
rems are proved. The first one establishes a sufficient 
condition for some nonsingular arcs to be optimal. The 
second theorem establishes a necessary condition: An opti- 


mizing arc never has a terminal singular subarc. 


THEOREM 3 
a 
If E does not contain a. singular subarc, and if x, > — 
a ep: 1 
and 0 < T < mint ee on — }, thien constant control 
aE 2.07 a 
u = 1 is the optimal control. By symmetry, if X5 2 4 and 
| xf 4 Cah X5 2 
oa. T < ae, t oa &£n ck xg then the constant conan 
u = 0 is the optimal control. 
Proof: 
il 
; Let E be a nonsingular extremal for which x, 2 ¢— and 
x H 
— > T. Then, by theorem 2, we know that an optimal control 
2 
is constant; either u = 0 or u =1. Now assume S(t) < 0 


Hor all tel0,Tl. 


. ‘omy 
From (13c), S(0) = -cyk5e : X5 + c,k,x} < 0. Hence 
aks 
1 leader wat 
T > C, &n Cghgxt . Therefore, by contrapositive argument, 


. roe '0 0 — 

Tht RS (1/c,)&n(c,k,x}/c7k5x5), then S(0) > 0. Thus u 0 
is not an optimal control, consequently the optimal control 
is u= 1. A similar argument establishes the symmetrical 


result. 


aye 





THEOREM 4 
A maximizing control does not have a singular subarc 


ending at t = T. 


Proof: 
Assume that we have apath which has singular control at 

the end, say, at least in an interval ty < t< T. Let us 

consider At =T - t, to be small. We want to compare this 

path with paths which have other controls in the interval 

(tT). In particular we want to compare the three trajectories 

which have the controls u = s, u=1, or u = 0, in the final 

interval (t,,1). 


Consider the Taylor expansions of J at t = t,- 


dJ 1 d* J. , 
Us ge de, ( “at Je," 2 ( ice Jy cat + O(At?} 
dJ, 1 d*J, . ; 
Jy = Met, TO Sarg Je, Se ey Sn a he 
dS, 7 d?J) 
linn se, 2 (ge yy cat Ss ) Nu + O(At*) (18) 


Since we assume that the trajectory lies on the singular sur- 
face at the end of the battle, to have singular control on an 
admissible extremal at time T, it is necessary that 

S(T) = S(T) = 0. 


From (6) and (7), we have 





a,c,k, = a5Cok, i.e ae ik : 
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Also 


in the interval of singular control. Then at time ti» this 
condition must be satisfied. 
From (10), (12a), (13a) and (14a), we may write the 
payoff 
awt c a a a(T-t,) 
= 2 2 a 1. 1 1 
Jo =k, ¢ a. | aa xy ~ Je ] 
i t 2 
where x7 xX, (t,) > 0, X5 x5 (t,) Ep) 
Similarly, 
a Cc aC a Cc (T-t,) 
il | Lil i i! il 
J.g=e8k. [| — + x? - (T-t., ) (x — }e ] 
1 1 Cy Co 1 C5 1 1 Cy 
a,c e ac ee(1=t3)) 
in egal ‘e : eT _ Agel 2 il 
STM ete eX me Ch ty) tat Bx ame ve 
2 Z 2 
Their derivatives with respect to t, are; 
7 a ott.) 
roe 1. 1 1 
J. kicy (L5 = ne 
x ; a(T-t,) 
J. = kc yay - ae 
ac a eo. (T-te) 
> > 2 i ee Z 1 
Jy ky [ + C1, Oy > je ] 
a Geach =. 8) 
a ie eA a 1 1 
Jy kK cy Oy ¥ Je 
ifaec CaCl=t*) 
eX, .. " joe. elt Z 1 
foe tea «6 **3%, ~ “co Je 








ol 








a She ces) 
—_ me ie oo ty 
Hence, 
lim Je = kc ( iat xi - @g ) 
tT S md C4tC, 1 iL 


a 
. Tt — 2 4 s mill 
lim Jy kc, Oy C4 ) 


t,7T 
lin ede = koc.c. (x2 - — } 
t oT 0 FOS aaa Co 
iH 
' = t _ ? 
Since (Jide (Jide (J odes from (18) we have 
k eae 
= = 1 a ! = 2 7 oo i is 
Pome nee ese) Oy) OM tet tc, 
1: Vem | 
: ar ; ; a lea 
ae wlll = SN 2 tee) eG er = SF eS >O, 


Therefore, J, > Jes and Jo > Jes at least if T - ty Sees Ue 


it 
ficiently small. 

Thus we see that if a trajectory ends with a singular 
subarc at the end of the battle, it cannot be an optimal 
trajectory. 

From studying this problem we feel that following result 
is true. If so it would reduce this particular problem to 


-the two cases for the optimun, 1.2. -w0 Or U = la We 


were unable to prove this result or to find a counter example. 
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THEOREM 5 

A maximizing arc never has a singular subarc nor a corner; 
that is the optimal control is always either u=1oru = 0Q, 
depending on the given conditions. 


(Not proved.) 
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VI. DISCUSSION 


Considering the theorems given above, we Pee that if an 
extremal does not contain a singular subarc, the optimal 
control u* is (a) constant for all te(0,T) and is (b) either 
0 or 1. Which of these policies is better may be determined 
by direct computation and comparison of values of the cri- 
terion functional using the attrition-rate coefficients and 
the initial conditions. Thus, we see that in this case the 
optimal time-sequential fire-support policy for Blue is to 
concentrate all his artillery resources on one of the Red 
units for the entire duration of battle. It should be 
pointed out that even if an extremal contains a singular 
subarc, Blue's artillery fire will tonnes Still be 
concentrated on one of the Red units at the end of battle. 
This ‘ig nAcairet an optimal fire-support policy has been 
characterized as not terminating with a singular subarc. 

We have not been able to show which policy is optimal 
when the trajectory contains a singular subare for some 
period of time before the end of the problem. Thus, the 
complete explicit determination of the optimal fire support 
policy was not accomplished. However, the basis has been 
given for doing this in the future. 

It should be pointed out that the insights gained into 
optimal fire-support policies obtained from the model (1) 
are no more valid than the model itself. Thus the reader 


Should.be cautioned that the optimal fire-support policy 
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determined here may only apply to the problem ely. As 
discussed by Taylor (see [14]), a combat optimization prob- 
lem consists of the following chee parts: (1) decision 
criteria, (2) model of the planning horizon (e.g. conflict 
eoRUMMlNe raaware condition), and (3) model of combat dynamics. 
For different objectives or different combat dynamics, the 
optimal fire-support policy may well be different from that 


given here. 


aS 





VII. CONCLUSIONS 


For the problem considered sue, lls thesis, the optimal 
fire-support policy is piecewise constant with at most two 
SWitches. If one never divides fire between the two enemy 
units (singular control), then the optimal policy is con- 
stant and either 0 or 1 (i.e., concentrate all fire on one 
of the enemy units). Although complete details could not 
be worked out within the time constraints for this thesis, 
it is conjectured that this is the optimal policy (i.e., 
always concentrate all supporting fire on one of the enemy 
units). The reader should be cautioned that these conclu- 
Sions may only apply to the specific fire support problem 


studied here. 
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